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Abstract. Many networks extent in space, may it be metric (e.g. geographic) or non- metric (ordinal). 
Spatial network growth, which depends on the distance between nodes, can generate a wide range of 
topologies from small-world to linear scale-free networks. However, networks often lacked multiple clusters 
or communities. Multiple clusters can be generated, however, if there are time windows during development. 
Time windows ensure that regions of the network develop connections at different points in time. This novel 
approach could generate small-world but not scale-free networks. The resulting topology depended critically 
on the overlap of time windows as well as on the position of pioneer nodes. 
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1 Introduction 



such as highway transportation networks [5] , the Internet 
[TP], or social networks 0[TD]. 
Spatial networks often display high neighborhood connectivity — 
features of small-world networks pQ — on the local and mul- 



tiple clusters at the global level of organization. Clusters 
indicate sets of nodes with a higher proportion of edges 
within the same cluster than between different clusters. 
Note, that small-world networks may or may not consist 
of multiple clusters. For example, small-world networks 
generated by rewiring regular networks as in [1] do not 
show multiple clusters. 

Whereas methods leading to small-world connectivity 
have been described before, there has been no systematic 
evaluation how multiple clusters arise during the growth of 
spatial networks. For example, in [2J we described spatial 
growth where the establishment of connections depended 
on the spatial distance between nodes. Whereas this ap- 
proach yielded a wide range of topologies from small- world 
to linear scale-free, there was no guaranty that multiple 
clusters or communities would arise. 

Here, we show that multiple clusters can be reliably 
generated following the concept of time windows. Many 
real- world networks show an underlying time window struc- 
ture - that is, several parts of the network develop in 
certain stages of network growth, which are called their 
associated time windows. This is a well-known fact for 
the development of neural and cortical networks [2UMS], 
or for the evolution of non-spatial networks such as social 
networks, the Internet, or the World- Wide- Web. 

We describe spatial growth with time windows in three 
dimensions in analogy to brain networks. However, similar 
considerations hold for two-dimensional spatial networks 



2 Methods 

In our algorithm, the establishment of an edge depends on 
the distance between the nodes ffi lTTlIT^] and the current 
likelihood of establishing a connection given by the time 
windows of both nodes. The distance-dependent probabil- 
ity is 

Pdist = 13 d (1) 

where d is the spatial Euclidean distance between two 
nodes, 7 = 6, and (3 = 6. The effect of varying 7, and 
(3 has been described previously [2]. 

The time-dependent probability Ptimc of a node is in- 
fluenced by its distance to pioneer nodes (N\ ...Nk € R 3 
where k € N is the desired number of time windows). The 
reasoning is that nodes originate from other nodes in the 
region thereby inheriting their time domains from previous 
nodes (Fig. [IJi). These regions are the basis for network 
clusters (Fig.Q}j). Each node has a preferred time for con- 
nection establishment and the probability decays with the 
temporal distance to that time (see appendix E] Fig. [8]). 

A highly symmetric placement of the pioneer nodes 
is desirable. Therefore, their coordinates are chosen de- 
terministically (i.e. only depending on fc) in the following 
way: When it is possible, they are placed at the vertices 
of a platonic polyhedron (that is, for k = 4, 6, 8), and oth- 
erwise equidistantly on a circle. 
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Fig. 1. (a) Two-dimensional projection of three-dimensional 
node positions. The grey levels represent the time window cor- 
responding to one of the three seed nodes (+). (b) Adjacency 
matrix after the clustering algorithm has been applied showing 
three clusters. 

As the time t in the simulation progresses from to 
1, new nodes and edges are added to the network in the 
following manner: (1) Coordinates for a new node U are 
chosen randomly (maximal distance to the origin is two 
units). (2) U is associated to the same time window as 
the nearest pioneer node, denoted by w(U). (3) To each 
existing node V, a new edge is established with probability 

P = P dist (d(U, V)) ■ pW?\t) ■ P^ ] \t) (2) 

where d(U, V) denotes the Euclidean distance between the 
nodes U and V. (4) If no edge to any existing nodes can be 
established, the new node is discarded. In that case, the 
time remains unchanged and steps (l)-(4) are repeated 
until a non-isolated node appears. This procedure ensures 
that the number of nodes grows linearly with time and 
that the final network contains the desired number of 
nodes. 

With this implementation, the network topology was 
studied under the variation of the final number of nodes 
from 50 to 400 in steps of 50, the number of time win- 
dows from 3 to 10, and the "integral parameter" a of 
the time window functions, indicating the width of the 
time window (see appendix lA"]) . from 0.1 to 0.6 in steps 
of 0.05. For each of the resulting parameter vectors, 40 
networks were generated. The mean as well as the stan- 
dard error of the mean was stored for each parameter 
vector. For simulations, we used MATLAB (Release 14, 
MathWorks Inc., Natick). Routines are available online 
( |http: / /www.b iological -networks.org[ ) . 

3 Results 

Significant correlations (R 2 > 0.95) of the measured val- 
ues to the three parameters are listed and a functional 
correlation value w /(parameter) will be given. This rela- 
tion means that the arithmetic mean over the parameter 
space of the other two parameters is calculated, and the 
resulting data is used for a correlation analysis. 



Clustering coefficient (CC) 

The clustering coefficient, calculated according to p], ranges 
from 0.19 to 0.38 (Fig. [2^). For three time windows, the 
CC is slightly negatively correlated with the number of 
nodes and almost independent from a, varying between 
0.21 and 0.26. For the other cases, CC positively corre- 
lates with a and negatively correlates with the number of 
nodes. 



Average shortest path (ASP) 

For three time windows, the ASP of the generated net- 
works varies slightly around a mean of 3.45, almost inde- 
pendently of the other parameters. For more than three 
time windows, the ASP is nearly independent of the num- 
ber of time windows, yet negatively correlated with the 
number of nodes, and a (ASP » 2.33 a~ 025 ), varying in 
a range between 2.3 and 4.9 (Fig. f2b). 

Degree distribution 

For all cases, the degree distribution does not follow a 
power law indicating that this model might not be able 
to produce scale- free networks [13] ■ Figure [3J shows plots 
of the maximum and median values of the node degrees. 
The maxima are linearly correlated with the number of 
nodes. They are also linearly correlated to the integral for 
more than three time windows. The median of the degrees 
is correlated by a power law to the number of nodes with 
y w 0.69 x 0A0 in the case of three time windows, and y ~ 
0.98 x 43 in the other cases. 



Edge density and multiple clusters 

For three time windows, the number of edges is only posi- 
tively correlated to the number of nodes (as expected, the 
number of edges grows like the square of the number of 
nodes), ranging from 90 to 2550. For the other cases, the 
number of edges ranges from 70 to 6550 and is positively 
correlated with the number of nodes N (square), as well 
as with a (linearly). 

As the generated networks show multiple clusters, it is 
important to know how densely the clusters are connected 
to each other. To get a single value still describing this in- 
terconnectedness, a measurement of " connections between 
the extremes" was introduced. Namely, for each simula- 
tion, a clustering algorithm (see appendix |C|) was applied 
to the adjacency matrix. The interconnectivity was then 
measured as the ratio of edges that connect the first and 
last quartile of nodes in the clustered adjacency matrix 
(F g ). This indicates how well the first and the last clus- 
ter of a network were connected. For the case of three 
time windows, this cluster interconnectivity was very low, 
between 1.0% and 1.9%, showing a small negative correla- 
tion with the number of nodes. For the case of more than 
three time windows, there is a clear negative correlation 
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Fig. 2. (a) Clustering coefficient, (b) Average shortest path 

length. Fig. 3. Maximum and median degrees of nodes 
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Fig. 4. Mean value of cluster connectedness F q for more than 
three time windows, plotted over integral size a. 

with the number of nodes (F q w 1 /N + C) . The constant 
C depends on the range of a and the number of time win- 
dows over which the mean is calculated, but is significantly 
larger than zero (for the parameter ranges used here, we 
get on average C w 0.16). As shown in Fig. [4] the mean 
value of F q varies with a, showing a local minimum for 
a = 0.25. 



Small world properties 

We tested for small- world features by comparing the clus- 
tering coefficients and average shortest paths with those of 
random benchmark networks (Fig. [5|). For each network 
generated by spatial growth, 20 random networks were 
generated, and the mean parameter values were compared. 
A network was considered to be small- world if ASP gon / ASP : 
1.3 and CC gen /CC ran d > 4. For three time windows, small 
world properties remain stable when changing the number 
of nodes and a. For a higher number of time windows, the 
small-world feature is negatively correlated with a (Fig. 
Et). We have CC gcn /CC rand « 0.75 N - 35 , CC gcn /CC rand « 
2.6/y/N, ASP g en/ASP ran d ~ y/a, and a correlation be- 
tween a and ASP gcn /ASP ran d, with a local maximum for 
a = 0.2 and a local minimum at a — 0.35 (Fig. [SJa). 



Time-based growth analysis 

As our spatial networks grow over time, we also tested 
how network features evolved over time. For the same 
time window and a range as before, networks with 400 
nodes were generated. For each step of 50 generated nodes, 
we observed the total number of edges as well as the 
number of inter-class edges between nodes with differ- 
ent time- window preferences for connection establishment. 
This tested whether there is a growth stage in which the 
generation of local (within clusters), or global (between 
clusters) connections occurs. In the results, again the three 
time window case is interesting: independently of a, the 
majority of inter-class edges are established at the very 



d < 




Fig. 5. ASP and CC ratios of the generated spatial networks 
relative to comparable random networks. 
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Fig. 6. (a) Networks of small world type (gray) (threshold: 
ASP g en/ASP rand < 1.3 and CC gc „/CC rand > 4); (b) Correla- 
tion between a and ASPgen/ASPrand 
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beginning and the very end of growth, as well as in a very 
limited period when half of the final number of nodes were 
established (Fig. [7]) . For more than three time windows — 
apart from some artifacts produced by pioneer node geometryfer temporal growth of spatial networks depending both 
the inter-class growth did mainly occur around the half on the distance between nodes as well as on time do- 
time of development. mains for connection development. For neural systems, 



Fig. 7. Ratio of inter-class edges at different stages (number 
of generated nodes) of the network growth. 



4 Discussion and conclusion 

The investigated approach was able to generate small- 
world multiple-cluster networks. We found that network 
topology is mainly influenced by the number of time win- 
dows and the spatial position of pioneer nodes and thus 
time domains. 

The case of three time windows behaves remarkably 
different from the other numbers of time windows. A po- 
tential explanation could be that the low overlap of the 
time windows (see appendix [B]) allows only for a very 
slow network growth, in the sense that most nodes are 
discarded as they were not able to link to the existing 
network. 

Another critical parameter is the spatial position of pi- 
oneer nodes. For the connectivity within and between clus- 
ters, "artifacts" can be seen independently from the num- 
ber of time windows. This gives evidence that these val- 
ues depend on the actual placement of the pioneer nodes. 
Therefore, the interconnectivity and the size of network 
clusters could be adjusted by merely changing the place- 
ment of pioneer nodes while preserving the number of time 
windows. 

In conclusion, we have presented a general framework 
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the time preference for establishing connections might be 
given by external/epigenetic factors in the surrounding 
medium or from genetic factors inherited from common 
precursor cells. Future work will have to investigate the 
role of pioneer node placement and mixing differently shaped, 
e.g. wide and narrow, time-windows. 



A Time window functions 



The time window function had to meet the following cri- 
teria: (1) The connection probability is zero at the start 
and end of development, i.e. iv^ e (0) 

anyz, (2) p£ e (/x«) = 1 
differentiable, and (4) In a set of time window functions 

P t ^ c , i = 1 ... k, the integral := /„* P^ e (t) dt should 
be the same for any i. 

These criteria were met for 



= 41(1) = for 
(3) 41 should be continuously 



1 



16 



log(2) 



with M W := A 
of the integral, so that Jg P^ e (t) dt 



(3) 

and a as the desired value 



i°gO) 

a (Fig. and 

En(a) being a numerically determined scaling factor to 
get the desired integral value a. 




B The Overlap index 



Fig. 8. Plot of P t ' l ,] lc for five time windows with (a) a = 0.2, 
or (b) a = 0.4. 



For the time window functions given in equation^ or time 
window functions in general, the overlap between pW and 
P<j), i ^ is, in some sense, a measure of how many 
connections are on average established between the corre- 
spondent node classes in a simulation. The definition of 
the overlap index Q shall make this heuristic observation 
more precise. First of all, we will introduce the overlap 
Ov(pW,P")) for two time window functions. Let us de- 
fine 



Ov(pW,pC0) := 



fip(*)(t)pW(t)dt 



y/ si (pim* dt.ji(pu)(t)) 2 dt 

(pW,p(i)) 



|p(« 



IPO) I 



(4) 



where (•, •) is the standard scalar product on the Euclidean 
vector space of integrable real valued functions on the in- 
terval [0, 1], and || • || 2 is the associated norm. 

What we still need is a global measure Q for the over- 
lap, given a set P^ 1 ) . . . P^ k ' of time window functions 
used in a simulation. For this, we will take the average of 
Ov(pW , P^)) for i ^ j, weighted by the difference of time 
window indices (j — i). This takes into account the fact 
that usually the maxima of the time window functions are 



placed at equal distance on the unit interval (remember 
we defined fi^ 1 ' :— jgrj). So we define 



f):=±. (j-*)-Ov(pW,P«) 



(5) 



l<i<j<k 



What remains is to calculate the sum of the weights, Sfe. 
This is done easily: 

k k k 

Sk= = X/( fc ~*) = fe X/~X/ 2 

l<i<j<k i=l i=l i=l (6) 

_ fc 2 (fc - 1) _ fc(fc — l)(2fc — 1) 
~~ 2 6 
So finally we get 



fc 2 (fc-l) _ fc(fc-l)(2fc-l) 
2 6 



(7) 

Figure [5] shows a plot of the Overlap index £1 for the case 
of the standard time window functions defined in equation 
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G2 where the number of time windows runs from 3 to 10, 
and the value of the integral, a, runs from 0.1 to 0.6. 




Fig. 9. Overlap index O of the standard time window functions 
depending on number of time windows, k, and the integral a. 



C The clustering algorithm 

For the measurement of cluster interconnectivity F q , a 
simple clustering algorithm was applied to the adjacency 
matrix. 

Given the adjacency matrix representation (a^) of a 
network, each element ay stores a Boolean value saying 
whether the edge (i,j) exists in the network: 

{1 nodes i and j are connected 
(8) 
otherwise 

We use a cosine-based similarity definition. Nodes are 
thought of as vectors in n dimensional space, where n is 
the number of nodes. Node i is represented by the ith 
column vector i of the adjacency matrix. The similarity 
between two nodes is measured by computing the cosine 
of the angle between the associated vectors. The similarity 
between the nodes i and j is given by 

m i ,=c sZ(ii) = ^ j (9) 

We call rriij the connectivity matching index between the 
nodes i and j (i ^ j). It is a measurement for the similar- 
ity of their connection patterns. The entries my form the 
connectivity matching matrix M . 

Two nodes with a high matching index show a similar 
linking pattern. Therefore they are likely to be within the 
same cluster. 

The node i with the highest number of connections 
to other nodes is chosen as initial node. It is the first 
node of the newly arranged network. The node j with the 
highest matching index regarding node one is chosen 



to be node two. The node with the highest matching index 
regarding node two is labelled node three. The process 
continues until all nodes are labelled. A newly ordered 
adjacency matrix Ac is then generated. 

Marcus Kaiser was supported by EPSRC (EP/E002331/1). We 
thank Freya Gnam for developing the clustering algorithm de- 
scribed in appendix [C] 
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